CHAPTER 12 Comparing Proportions and Analyzing Cross-Tabulations 167

Putting it all together with some

notation and formulas

The calculations of the Pearson chi-square test can be summarized concisely

using the cell-naming conventions shown in Figure 12-6, along with the standard

summation notation described in Chapter 2.

Using these conventions, the basic formulas for the Pearson chi-square test are as

follows:»

» Expected values: Ex

R

C

T

i

N j

M

i j

i

j

,

,

, , ...

;

, ,

...

1 2

1 2»

» Chi-square statistic:

2

2

1

1

(

)

,

,

,

Ob

Ex

Ex

i j

i j

i j

j

M

i

» Degrees of freedom: df

(

)(

)

N

M

1

1

where i and j are array indices that indicate the row and column, respectively, of

each cell.

Pointing out the pros and cons

of the chi-square test

The Pearson chi-square test is very popular for several reasons:»

» It’s easy! The calculations are simple to do manually in Microsoft Excel

(although this is not recommended because the risk of making a typing

mistake is high). As described earlier, statistical software packages like

the ones discussed in Chapter 4 can perform the chi-square test for both

individual-level data as well as summarized cross-tabulated data. Also,

several websites can perform the test, and the test has been implemented

on smartphones and tablets.

FIGURE 12-6:

A general way of

naming the cells

of a cross-tab

table.

© John Wiley & Sons, Inc.